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Abstract 

We describe the extension of the improvement program for bihnear operators composed of Wilson 
fermions to non-degenerate dynamical quarks. We consider two, three and four flavors, and both 
flavor non-singlet and singlet operators. We flnd that there are many more improvement coefficients 
than with degenerate quarks, but that, for three or four flavors, nearly all can be determined by 
enforcing vector and axial Ward identities. The situation is worse for two flavors, where many 
more coefficients remain undetermined. 
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I. INTRODUCTION 



Simulations of lattice QCD with light dynamical quarks are greatly facilitated by the use 
of improved actions and operators. Calculations are underway using various types of im- 
proved fermions — staggered, Domain Wall/overlap, maximally twisted and improved Wilson 
fermions. Here we focus on improved Wilson fermions, and investigate how the improvement 
program can be extended to remove errors proportional to aniq (a is the lattice spacing, and 
TUq a generic quark mass) in the realistic case of non-degenerate quark masses. Of particular 
interest are the 'Wj = 2 + 1" theories with = m„ < and 'Wj = 2 + 1 + 1" theories 
with irid = rriu < rris < rric, and we consider both theories here. As we review below, the im- 
provement of the action for such theories including arriq terms has already been considered, 
but the improvement of operators has not. As a first step, we consider here the improve- 
ment of all quark bilinears. These are of considerable phenomeno logical interest, since their 
hadronic matrix elements, combined with experimental results for form factors, determine 
elements of the CKM matrix. Errors in such hadronic matrix elements proportional to arris 
and particularly arric can be significant, and it is thus important to reduce or remove them. 

At present, simulations with staggered fermions are able to reach the smallest dynamical 
quark masses. This comes at the cost, however, of a multiplication of fermion species, and 
the concomitant need to use the fourth root of the fermion determinant, so that unitarity 
can at best be restored in the continuum limit. Wilson fermions have the advantage of 
a straightforward relation to the continuum theory: each lattice fermion gives rise to a 
single continuum flavor. They also come, however, with disadvantages: in their original 
form, the leading discretization errors are of 0(a), as compared to O(a^) for staggered 
fermions, and these discretization errors explicitly break chiral symmetry. As explained in 
seminal papers by the ALPHA collaboration [l|, one can apply the Symanzik improvement 
program to Wilson fermions and systematically reduce the errors from 0{a) to 0{a^)} We 
recall that this requires the addition of all dimension five operators to the action that are 
consistent with the symmetries of the lattice theory, with their coefficients being determined 
by appropriate non-perturbative conditions. These conditions are generically chosen to 

^ Since the gauge action gives rise to errors of O(a^) we do not consider its improvement here, although our 
considerations apply equally well for an improved gauge action, as explained in appropriate places in the 
following. 
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enforce a symmetry that is present in the continuum hmit but is broken for non-vanishing 
lattice spacing. For Wilson fermions, the broken symmetries that are used are the flavor 
non-singlet axial symmetries. 

A similar method holds for the improvement of operators.^ One must add all operators 
with the same symmetries having one higher dimension and determine their coefficients by 
applying appropriate "improvement conditions". This assumes that the operators do not 
mix with other operators of lower dimension, which will be true in all but one case here. 

It is possible to simplify this procedure by considering only the improvement of on-shell 
quantities — masses, decay constants, physical matrix elements, etc. This allows one to use 
the equations of motion to reduce the number of higher dimension operators that need be 
considered. On-shell improvement is equivalent to improving correlation functions in which 
the arguments are all separated. Off-shell improvement extends this to correlation functions 
in which some arguments are at the same space-time point, and requires additional contact 
terms. While we will consider only on-shell improvement here, it will turn out that we need 
to understand some of the subtleties of off-shell improvement in order to resolve certain 
puzzles that emerge from our analysis. 

The general discussion of Refs. 1] applies in the presence of two or more dynamical 
quarks. Thus we know from that work how to non-perturbatively improve the action, and 
it is now standard to implement this in unquenched simulations. Results are available 
with the Wilson gauge action for two P, 0] three O] and four flavors, and also with an 
improved gauge action [4]. The method of Ref. [l| also allows one to improve the flavor 
non-singlet axial current in the chiral limit (and a variant of this method has been applied 

n 

for two flavors in Ref. nj), and the methodology has been extended to the improvement of 

nnn 

non-singlet vector and tensor bilinears in the chiral limit |6|, 17|, |8|| . Theoretical discussion 
of improvement of non-singlet bilinears away from the chiral limit has been restricted to 
degenerate quarks 0] , or to non-degenerate quarks in the quenched approximation 0, 0] . 

Here we generalize previous work by considering the improvement of bilinears for the real- 
istic case of non-degenerate quarks. We consider both flavor singlet and non-singlet bilinears; 
the addition of flavor singlets is required by the analysis, but is also of phenomenological 



^ We use the term "improvement" as shorthand for "0(a) improvement" throughout this paper. We do not 
consider the removal of discretization errors proportional to or higher powers. 
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interest. We explain how working away from the chiral hmit introduces a plethora of new 
improvement constants, and then study which of these can be determined by imposing 
appropriate axial Ward identities. We find that, for three flavors, nearly all can be deter- 
mined, with the situation unchanged for four flavors but worse for two flavors. To determine 
the remaining irnprovement coefficients one must use other methods, e.g. non-perturbative 
renormalization 3| , suitablvimproved , or matching short distance correlation functions 
to perturbative expressions 8|. 

In this work we need make no specific choice as to how the Ward identities are imple- 
mented. One could use a method based on the Schrodinger functional, or use standard 
hadron correlation functions. Our theoretical discussion holds equally well for either choice. 
What is key, however, is that one can vary the quark masses independently in a regime 
where all have m^a < 1, so that effects proportional to (m^a)^ can be neglected. This 
condition is satisfied for the physical strange quark for lattice spacings satisfying approx- 
imately a < (1 GeV)~^, and for the physical charm quark if a < (4 GeV)~^. We stress 
that these conditions do not necessarily require the quarks to be light (with "light" meaning 
nig < Aqcd)- 

The practical implementation of our method will be very involved and computationally 
expensive. An indication of this is that even the simplest step of determining the improve- 
ment constant for the non-singlet axial current in the chiral limit has only recently been 
undertaken for dynamical fermions j^. For most improvement constants, present calcula- 
tions instead rely upon one-loop perturbative values. This raises the question of whether 
our analysis is of purely theoretical interest or will be useful in practice. To ask this another 
way, will a tree-level or one-loop perturbative estimate of the improvement constants suffice 
in practice? For example, the mass dependent improvement coefficient for the non-singlet 
flavor off-diagonal axial current (needed to determine decay constants) enters in an overall 
factor of 1 + a6^(mj + mk)/2. If we use the tree level value of Ba = 1, then the error in this 
factor is ~ asams/2, assuming a one loop correction to Ba of order unity times as- Taking 
rus ~ 0.1 GeV, a^^ ~ 2 GeV, and a.^ ~ 0.3, the error is less than 1%. This may be smaller 
than other sources of error, in which case the tree level value for Ba would suffice. On the 
other hand, if we consider /^i in the four flavor theory, then the corresponding error is much 
larger, ~ 10%, and a more accurate determination of 6^ is likely needed. 

The outline of this paper is as follows. In the next section we recall previous work on 
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the improvement of the unquenched theory in the chiral hmit. In section UTTl we describe 
the additional improvement coefficients that are needed in the unquenched theory for non- 
vanishing quark masses. We then, in section IIVI lay out the Ward identities that can be 
used to determine most of these coefficients. We end by discussing some implications of our 
results in section |3 Two appendices present the generalizations to two and four flavors. 



121. 



This paper is an expansion, clarification and, to some extent, a correction of Ref. 
In that work we argued that some of the improvement coefficients could be determined by 
imposing vector Ward identities. It turns out that this was not correct in all cases, due to 
the presence of certain contact terms. We explain this point in a final appendix. 

A brief summary of the present work has been given in Ref. 



II. REVIEW OF PREVIOUS WORK 



We begin by reviewing previous work on non-perturbative 0{a) improvement of un- 
quenched QCD. The ALPHA collaboration jl| has shown how on-shell improvement of the 
action can be accomplished by adding the Sheikholeslami-Wohlert or "clover" term, with 
appropriately chosen coefficient csw- Their method requires a flavor non-singlet axial cur- 
rent, and thus works if the number of light quarks, Nj, is two or greater. Of course, the 
resulting value of csw depends on Nj. 

We briefly review the method of Ref. Q], both for completeness and to introduce our 
notation. One considers matrix elements of the improved axial current and pseudoscalar 
density, which, in the chiral limit, take the form 

If) = Z^Af = ) + acAd.P^^''^ U ^ k) (1) 

P^j^) = ZpP^^^^'^ P^^''^'^ = P^^^^ {j ^ k) . (2) 

Here we introduce the notation that we use throughout this article: a "hat" on an operator 
indicates that it is both 0{a) improved anc? properly normalized, while the superscript "I" 
indicates improvement alone. The improvement here is on-shell, not off-shell. Flavor indices 
are shown as superscripts, with j, k = 1 — Nf. and P are ultra-local lattice transcriptions 
of the axial current and pseudoscalar density respectively. The simplest choices are exem- 
plifled by A^ii''\x) = ijj^ {x)'~)^'~)^il)^{x), with a^/'^il){x) being the bare lattice fermion at site 
X, but our considerations hold for any ultra-local choices. Finally, 9^ is an 0(a) improved 
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lattice derivative, e.g. the symmetric difference divided by 2a. Factors of the lattice spacing 
a are shown explicitly throughout, so that all quantities have the same dimensions as their 
continuum counterparts. 

To determine Csw one enforces the simplest axial Ward identity 

(a^lf = (m, + m,){P'^^'\x))j + 0{a') . (3) 

Here the rhi are improved and normalized quark masses, whose relation to the bare quark 
masses is discussed below. The subscript on the expectation values indicates that these 
matrix elements are to be evaluated in the presence of sources, J, which are located at 
different positions from the operators. The sources should have quantum numbers such that 
the result is non- vanishing, but are otherwise arbitrary (in both form and position). They 
could be boundary sources in the Schrodinger functional, or standard hadron operators in a 
traditional large volume calculation. We need not (and do not) specify them. What matters 
here is that this equation should hold for any such sources (which thus create states with the 
appropriate quantum numbers in all possible linear combinations). The left- and right-hand 
sides will only match as the sources are changed (while holding the bare quark masses and 
couplings fixed so that the rhi are fixed) if both csw aiid ca are chosen correctly.^ Since the 
accuracy of matching is O(a^), these constants can only be determined to a relative accuracy 
of 0(a). 

An important point concerning the implementation of eq. (jH)) is that one does not need 
to know Za, Zp or the rhi. One need only require that the ratio of the matrix elements of 
the improved, but not normalized, quantities, {A^^^'^'^)j and {P'^^^^)j, is the same for any 
choice of J. One can then extrapolate the resulting values of csw and ca to chiral limit. 
This limit can be determined by setting all Nj bare quark masses equal, and extrapolating 
to the common value for which the right-hand side of eq. (jHl) vanishes. One then has in hand 
the desired improvement constants, which will depend, for a given choice of gauge action, 
only upon the bare coupling constant g^^ 

At the same time, one has determined the critical value of hopping parameter, i^c{9o), as 

^ In practice, one can cancel the contributions proportional to one or other of these constants by taking 
appropriate linear combinations using different sources. This has been done in the determination of csw 
in Refs. Q, |^ Q , and of ca in Ref. ^ . This allows one to tune the sources to improve the sensitivity 
separately for each improvement constant. 

^ In practice, it may be advantageous not to extrapolate csw and ca to the chiral limit There would 
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this is the value for which the right-hand side of eq. Q vanishes given degenerate quarks. 
One can then define bare quark masses in the standard way: 

arrij = . (4j 

These calculations also give information about Za/Zp, but we postpone discussion of this 
until we have set up the tools to work away from the chiral limit. 

The method has been extended, in the chiral limit, to other bilinears. On-shell improve- 
ment requires addition of all dimension four operators with appropriate quantum num- 
bers 

Vm,i = v^a/c) + acy9.T(f ) (6) 
TjfJ"^'' = Tl^^ + «CT [d,V^''^ - d,V^^'^] , (7) 

where j ^ k. Each of these operators also has an associated normalization constant. The 
improvement constants cy and ct can be determined by enforcing appropriate axial Ward 
identities in the chiral limit, making use of the previous determination of the improved 
axial current. In particular, enforcing that the axial variation of is proportional to 
determines cy — see Refs. jsl Q, 0] for particular methods — while enforcing that T^^, rotates 
into other components of itself determines ct ^t^- 

Up to this point we have considered only flavor off-diagonal bilinears, i.e. those with flavor 
indices satisfying j ^ k. For the subsequent analysis we will need to consider also diagonal 
flavor non-singlet operators (as well as flavor singlets). A convenient common notation for all 
non-singlet bilinears is ti{XO), where the trace is over flavor indices, and A is one of the eight 
Gell-Mann matrices for Nf = 3 (our primary focus), the Pauli matrices for Nf = 2, and the 
appropriate generalization for SU (4). In the chiral limit, where the SU (Nf) flavor symmetry 
is unbroken, the improvement of the off-diagonal non-singlets described above carries over 
also to the diagonal non-singlets, with the same improvement coefficients. Thus, for example, 



then be residual 0{a'm) contributions in these coefhcients, but this does not effect 0{a) improvement. 
We find it conceptually simpler, however, to imagine that these coefficients have been extrapolated to the 
chiral limit, so that they are independent of the quark masses that we will be varying in the subsequent 
discussion. 

^ Our convention for the tensor is TjfJ'^x) = (a;)i(T^t^-0*^(x) with cr^^ = ^«[7^,7i/]. 
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the general improved non-singlet axial current is 

tiiXA^y = tr(AA^) + acAd^tT{\P) , (8) 

for all choices of A. 

When one moves away from the chiral limit, many new improvement constants are needed. 
Consider first the gluon action. Possible higher dimension gluonic operators are of dimension 
six, so the only contribution at 0(a) is the original action-density multiplied by the trace of 
the quark mass matrix, M. This leads to an effective gluonic coupling constant (if 

9o^&'o= (1 + « bgtiM/Nf) , (9) 

where bg is a function of g^. If we work at fixed bare coupling, and vary trM, then the 
effective coupling will vary, as will the lattice spacing, and so dimensionful quantities will 
have 0(a) contributions proportional to abgtiM. To avoid these, Qq must, in principle, be 
varied as trM changes in such a way that qq is held constant. This requires determining 
the improvement constant bg. Non-perturbative methods for doing so have been proposed 
in Ref. ^, and the one- loop perturbative result is given in Ref. 15 1. 

For most of the following discussion we assume that bg is known, and that whenever we 
vary M we do so with go, and thus a, fixed. In fact, we will find that our method provides 
an alternative, independent determination of bg. Thus one does not need to rely on the 
methods of Ref s. flQ. 

The improvement of quark masses and bilinear operators away from the chiral limit in 
an unquenched theory has been discussed previously, but only in the case of degenerate 
quarks 1^. We do not recall this work here, since we will generalize it in the following to 
quarks with non-degenerate masses. 



III. ADDITIONAL IMPROVEMENT COEFFICIENTS 

In this section, we describe the new improvement and renormalization coefficients that 
are required in order to improve, at 0(a), flavor singlet and non-singlet bilinear operators 
for general values of quark masses. The reason for the inclusion of flavor singlet bilinears 
will become clear later. 

Consider first flavor singlet operators in the chiral limit. Matrix elements of these oper- 
ators have "quark-disconnected" contractions (in which the operator connects to external 
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fields through gluons) in addition to the usual "quark-connected" contractions present also 
for flavor non-singlet operators. It follows that the improvement coefficients ca, cy and ct 
will not be the same as those for non-singlet operators, and thus we denote them c^, cy 
and dxi respectively. The differences begin at two- loop, and thus are of 0(5'q) barring un- 
foreseen cancellations. Furthermore, some of these operators can "mix" with purely gluonic 
operators. Enumerating the available gluonic operators, one finds the following forms for 



on-shell improved flavor singlet bilinears:^ 

{iiA^y = iiA^ + a CAd^tiP ; (10) 

{tiV^y = tiV^ + a cvd.tiT^u ; (11) 

(trT^)^ = trT^ + a ct [d^tiV, - d.tiV^] ; (12) 

[tiSy = a~^es + tr5 + agsTi{F^,F^,) ; (13) 

(trP)^ = trP + agpTi{F^,F^,) . (14) 



Here we use "tr" for the trace over flavor indices, and "Tr" for that over color indices. For 
later convenience, it is important that the discretized form of Ti{F^j^F^y) is exactly that 
combination of Wilson loops which appears in the gauge action [see eq. fj37p below], with 
an average over loop positions so as to be centered on the site where the bilinear is placed. 
For Ti{F^yFi^iy) one can pick any local choice, e.g. that based on the clover-leaf. 

The mixing of trS* with the identity operator in eq. (jl3|) was overlooked in Ref. jl^ . 
To improve trS*, the coefficient es would need to be determined to an accuracy of a^. To 
calculate hadronic matrix elements, however, one must subtract disconnected contributions 
anyway, and this completely removes the es term. Similarly, the es contribution can be 
canceled when implementing Ward identities by subtracting disconnected contributions, as 
we discuss explicitly below. In this way one can avoid the problem except when calculating 
the vacuum expectation value, i.e. the quark condensate. To obtain an improved version of 
the condensate, one must use another method, as also discussed below. 

We now turn to improvement away from the chiral limit. We consider explicitly the case of 
three light dynamical flavors; the generalizations to two and four flavors are described in the 

6 In Ref. !l3 we erroneously concluded that improvement of the tensor bilinear required the inclusion of an 
additional gluonic operator, which, however, vanishes identically. 
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appendices.. We restrict the discussion to a diagonal bare mass matrix with positive entries, 
M = diag(mi, m2, ma), thus avoiding possible phase structure associated with spontaneous 
violation of CP [16]. Treating M as a spurion transforming in the adjoint representation of 
the flavor SU{3) group, it is straightforward to enumerate the allowed improvement terms 
linear in quark masses. For non-singlet operators we find the general form 



tr(AC) 



{1 + abo tr M)tr(AC)^ + a ^ 6otr({A, M}0) + a fo tr(AM)trO 



(15) 



where O is any of the five bilinears. A possible term proportional to tr([A, M]0) is forbidden 
by CP invariance — only the anticommutator of A and M appears. Note that the operators 
appearing in the 0{a) corrections on the RHS (right-hand side) can be chosen to be improved 
or unimproved, the difference being of O(a^). Here we have left them unimproved for the 
sake of brevity, but in some equations below it is more convenient use improved versions. 
This choice has no impact on Zq because the explicit factors of quark mass do not allow 
mixing back with the leading operator O. 

There is one subtlety that is overlooked in eq. (fT3|) . The flavor-diagonal scalar bilinears 
(i.e. those for which the A are diagonal matrices) can also mix with the identity operator. 
For example, the operator S^^^'^ — S^'''^^ mixes with the identity with a coefficient proportional 
to {rrij — mj^)/a^. As for the flavor singlet scalars, however, this mixing is removed in all 
but the vacuum expectation value by subtracting disconnected contributions, and so we will 
keep its contribution to S^^^^'^ — 5'*^^'^^''^ implicit in the following. 

Aside from this subtlety, there are, for each of the five non-singlet bilinears, three im- 
provement coefficients bo, bp and fo, in addition to the overall normalization in the chiral 
limit, Zo- It is useful to understand the dependence of each of these quantities on the 
coupling constant and lattice spacing. In the chiral limit, the normalization depends both 
on the bare coupling and, if the corresponding bilinear has an anomalous dimension, on an 
(with fx the renormahzation scale): Zq = Zo{go,afi)- As discussed in Ref. jjl|, away from 
the chiral limit one must replace go with of eq. Q so that, in general, Zq = Zo(g^, afj)- 
The improvement coefficients do not, however, depend explicitly on a/i, and so are functions 
only of g^, or equivalently of only g^ at this order of improvement. 

To understand the significance of each of the improvement coefficients, it is useful to 
consider special cases. For flavor off-diagonal operators (which we will also refer to as 
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"charged" operators) fo drops out: 

d^^''^ = Zo[l + a boti M + a bom^k] O^^''^'^ , (16) 

where rrijk = {rrij + mk)/2. The fact that bo muhiphes the trace of M (and thus depends 
on all three quark masses) indicates that it arises from mass dependence of quark loops, 
and thus begins at two-loop order in perturbation theory, and is absent in the quenched 
approximation. The bo term, by contrast, arises from the mass dependence of the valence 
quark propagators attached to the operator, and is present also in the quenched approxima- 
tion. We have chosen the normalization so as to match that of the standard form used in 
quenched applications of improvement: 



1 + a b%mJ O^^'^^'' . (17) 



Note that, although bo and 6^ arise from the same underlying effect, they will differ numer- 
ically (for a given choice of ^q) due to mass-independent contributions from quark loops. 
Since these enter first at two loop order, however, one loop results for bo from Ref. [jj] are 
valid also for 6q. 

The fo term enters into the improvement of flavor diagonal (or "neutral") operators, e.g. 

Q(i3) _Q{kk) = Zo[{l + a bo tr 

+ afo{mj-mk)tTO]. (18) 

Here, on the left hand side we have made the replacement 

(CXjiToCfcfc)) = _ oCfc/^) ^ (19) 

i.e. we have replaced the improved and normalized version of the operator {O^"^ - O^^^'^) 
with the difference of the improved and normalized versions of the individual operators. 
Similarly, in the first term on the right hand side, we have used 

[Qiii) _ Q{kk)y ^ Q{jj)J _ Q(kk),I ^ ^20) 

One might be concerned that there is a subtlety hidden in these replacements, since the 
individual operators O^""* contain flavor singlet parts and, as discussed further below, flavor 
singlet and non-singlet operators can have different anomalous dimensions. In fact, the same 
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issue arises in the continuum. We resolve it by simply defining the individual operators as 
appropriate sums of the flavor singlet and non-singlet parts, e.g. with two flavors 



2 



(C)(ll)_C'(22)) + (;0(ll)+0(22)) ^ (21) 



and similarly for the improved operators. Then the relations in eqs. ()19|) and ()20|) are 
identities. 

Returning to eq. ()18|1 . it is clear that the fo term arises from quark-disconnected con- 
tractions of the operator, because it is only through such contractions that mixing with the 
operator tr (9, which contains all flavors, can arise. From this we learn that fo appears first 
at two-loop order. We remark that fo is also present in improvement of quenched bilinears 
if one considers flavor-diagonal non-singlets and keeps disconnected contractions. As far as 
we know, no such calculations have been done to date. 

The presence of an extra improvement coefficient in the diagonal non-singlet operators 
as compared to the off-diagonal non-singlets leads to the following apparent paradox. A 
non-singlet vector rotation {Sy'ipj = ipk and Syipk = transforms off-diagonal opera- 

tors into diagonal ones: SyO^''^^ = O^^^^ — 0^^^\ Thus the corresponding Ward identity 
[given explicitly in eq. ()C8|1 ] should be enforced for improved operators on the lattice up 
to O(a^) corrections. Assuming that SyS is known, this identity relates an operator with 
two improvement coefficients {bo and bo) to an operator with one more improvement coeffi- 
cient (/o)- Thus it seems to imply that fo is not independent. This would be paradoxical, 
because fo is allowed in the first place by the vector symmetry which is subsequently lead- 
ing to the constraint. This paradox is resolved by analyzing the possible contact terms in 
Appendix IC 31 

Previous discussions of improvement in the unquenched theory have considered only 
degenerate quarks To make contact with the notation used in these papers, we note 
that the general result (|TK|l reduces to 

t^)\^^Q^ = Zo[l + a {bo + Nfbo) m] tr(AO)^ , (22) 

for degenerate quarks. For charged bilinears, this has the same form as used in Refs. 
except that what we call bo + Nfbo was denoted simply bo in those works. We prefer our 
notation because of its connection with the quenched improvement constants and because 
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it is more easily generalizable to non-degenerate quarks. We note that at one-loop order the 
difference in notation is immaterial because, as noted above, bo vanishes at this order. 

Next we consider the mass-dependent improvement coefficients needed for flavor singlet 
operators. We find that the general form consistent with flavor symmetry is 

tTo = Zoro[{l + adotrM)ti{Oy + adotr{MO)] . (23) 

There are only two mass- dependent improvement coefficients for each bilinear (as opposed 
to the three needed for non-singlets), but there is an additional normalization factor, tq, 
appearing in the chiral limit. The latter arises because the normalizations of singlet and non- 
singlet operators differ, since the former can have quark-disconnected contractions. Rather 
than introduce a new normalization constant Zq, we have parameterized this effect with 
the ratio vq = Zq/Zq- For the axial current {O = A^) the anomalous dimension of the 
singlet and non-singlet operators differ (the former beginning at two-loop order, while the 
latter vanishing to all orders), so that must depend explicitly on In a in addition to the 
usual dependence on g^. For the four other bilinears the singlet and non-singlet anomalous 
dimensions are the same (the anomalous dimensions vanish for all vector currents, and the 
quark-disconnected loop diagrams for S, P and T vanish by chirality in the continuum). 
Thus rg, rp, ry and r-p do not depend explicitly on In a, and are functions only of the 
effective coupling constants, just like the improvement coefficients. See Ref. for a more 
thorough discussion of this point. 

We also need the expressions for 0(a) improved quark masses in terms of the bare quark 
masses. Using flavor symmetry and the constraint that the singlet and non-singlet mass 
combinations vanish at the same bare mass we flnd 

trAM = Zm[il + abmtTM)trXM + abmtriXM^)] , (24) 
trM = ZmT-ni [(1 + a J„tr M)tr M + a dmtr{M^)] . (25) 

The notation is analogous to that used for the bilinears. Note that there is no separate 
"/o-like" term in (j24|) . since such a term, proportional to tr(AM)tr(M), can be absorbed 
into the bm term. This reduction in the number of improvement coefficients will play an 
important role in the next section, where we will see how these constants are related to those 
needed to improve the scalar bilinear. The overall constant Z^ is scale-dependent, but all 

isl ]. are not. 



other constants, including 
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Since we restrict ourselves to a diagonal mass matrix, the result ()24|) is non-trivial only 
for diagonal A's. Taking appropriate linear combinations with eq. ()25|). one can obtain the 
result for individual masses: 



rhj = M. 



ruj + (r^ - 1)^ 



bm'm''j + bmnijtTM 



, ^ , ,tr(M2) ^ - - ,(trM)^ 



I\ f I\ f 



(26) 



From the 0(1) terms we see that fhj vanishes if all bare masses vanish together. This is 
by construction. For non-degenerate positive quark masses, however, fhj does not vanish 
when rrij = 0. This is not an effect which vanishes linearly in a, since it remains true even 
when 0{a) terms are dropped, because of the contribution proportional to (r^ — l)trM. 
In particular, this effect implies that, at fixed gauge coupling, the pion becomes massless 
at a value of the bare up and down quark hopping parameters (assumed degenerate) which 
depends (linearly) on the strange quark mass. This is similar to the well-known result that 
the partially quenched critical hopping parameter differs from the fully unquenched value 
(as discussed, e.g., in Ref. j3]). 

It is useful to consider which of these constants survive in the quenched approximation. 
The constants bm and dm do not, since they are produced by quark loops, as shown by the 
fact that they multiply tr(M). Similarly, the difference of r.m from unity, and the difference 
between and bm, arise from insertions of the mass on quark loops. Thus in the quenched 
approximation one only needs two constants: and b^ = d^. One way of seeing this is to 
note that can only depend on mj in the quenched approximation, which forbids all but 
the first and third terms in eq. (j26|) . 

In summary, moving from degenerate to non-degenerate quarks and considering singlet 
as well as non-singlet operators requires the introduction of a large number of additional 
improvement coefficients. All except for ZsZp, Zt, Zm and ta are scale independent func- 
tions of the effective coupling alone, ^ and for these scale-independent quantities there is no 
apparent obstacle to their determination using Ward identities. Indeed, as we show in the 
next section, nearly all can be determined in this way. 



^ Zg/Zp is scale independent, while ZgZp is scale dependent. 
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IV. DETERMINING COEFFICIENTS USING WARD IDENTITIES 



In this section we explain in detail how one can generalize previous methods to determine 
most of the new improvement and normalization coefficients. We organize our approach into 
four steps. This is partly as an explanatory aid, but also because some parts of the later 
steps rely on results from the earlier ones. There remains, however, considerable freedom in 
the ordering of parts of some steps. 

In summary, the four steps are the following. First, we enforce vector charge conservation, 
which determines most of the improvement coefficients, and the normalization constant, of 
the vector bilinear. It turns out that this is the only use to which we can put vector Ward 
identities. In Ref. we had claimed otherwise, but this turns out to be incorrect because 
of overlooked contact terms. We explain this point in appendix O 

The second step is to relate the improvement and normalization coefficients for the quark 
masses to those for scalar bilinears. This allows us to use, in step three, the simplest axial 
Ward identities such as eq. (jS)) ("two point Ward identities"), to determine combinations 
of coefficients for axial, pseudoscalar and scalar bilinears. Finally, in the fourth step we 
enforce axial Ward identities in which the axial variation occurs in a region including other 
operators ("three point Ward identities"). Here we have to deal with contact terms. 

TABLE I: Relations between renormalization and improvement coefficients for masses and scalar 
bilinears. LO and NLO indicate leading and next-to-leading order in quark masses. Results are 
valid for Nf = 3 and 4. They hold also for Nj = 2 if 65 and are set to zero. 

Order in M Relationship or constraint 

LO Zs = l/Zrn, rs = l/vm, 93 = bg/{2g^) 

NLO bs = -2bm, bs = -6m, Nffs = 2(6„ - dm), 

ds = bs + Nfbs, ds + Nfds = -2{dm + Nfdm) 
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TABLE II: Normalization and improvement coefficients determined using various Ward identities 
(which are denoted schematicahy) for A^^ = 3. LO and NLO indicate leading and next-to-leading 
order in quark masses. For completeness, we indicate which Ward identities determine csw, cy, 
CA and CT- For the last nine Ward identities (those below the double line), we assume that the 
on-shell improved flavor non-singlet axial variation of the action has been determined using the 
previous identities (i.e. those above the double line). The notation "Not new" in the final two lines 
indicates that these two identities are equivalent, in the chiral limit, to those considered previously. 



Ward identity 


LO 


NLO 




{H\EsVi''^\H) = Q'n 


Zv, ry 


V) by, fv, dv, dy 




5^4'''^ = (% + mfe)pW 


ZtuZp/Za-, rm 


bA, fA, bp- 2bm, bm + 2rmdm 




and 


rp, gp 


(1 -1- 2rm)'^bm + Qrm{bm - dm) 




d.iAji^^ - If)) 


[csw, ca] 


2(2 + rm)bm + 3(6p -bA + b^) 








bp + 2rpdp, {2 + rp)bp + 6fp 








rp{2 + rp){bp - dp) - (1 + 2rp + 3rmrp)fp 




Za, [ct] 


bT, SbA - bA{rm - 1) 






Zv,Zl, [cv] 


bA + by, SIa - bAiVm - 1) 




and V A 




6ibA-bv) + {bA-bv){2 + r„,) 




jfe) p{jk) ^ g{ik) 


Zs/Zp, z\ 


bs + bp, 3bA - bAivm - 1) 




and P ^ 5 




6(bp-bs) + {bp-bs)i2 + rm) 




s'^ftiA = s^i'hvv = 


CA, Cv 


dA, dy 




(5j'HrP = 2S^'^^ 


Zprp/Zs, gp 


dp, 3{bs - dp) - bsirra - 1) 




6^^^\tT = 2T(*J) 


rT, CT 


dT, 3(bT - dr) - brirm - 1) 




S^l^kvS = 2P(*J) 


Zsrs/Zp, gs 


ds, 3(6p - ds) - bp{rm - 1) 




^ rp{ii) _|_ rp{jj) 


rr, CT, [ct] 


bT, dT, rridT - It) - Vmh 




^fe)^(ji) ^ p{ii) _|_ p{jj) 


Zp/Zs, rp, rs 


bp + bs, 2rpdp + bp, 2rsds + 65 




and 5 ^ P 


gp, gs 


6(bp - bs) + {2 + rm){bp - bs) 

3rp {dp-bs)+rp{dp-bs)-rm{3fp+bp- 

3rs (ds -bp)+rs{ds - bp) - rm {3fs + bs ■ 


-rpbs) 
-rsbp) 


^ yijj) _ yiii) 


Not new 


bv, by, fy 






Not new 


bA, bA, /a 
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We discuss the Nf = 3 theory in this section, summarizing the results in Tables H] and 
im The former collects the relationships between constants, while the latter shows which 
can be obtained from which Ward identities. The generalizations to Nf = 4 and Nf = 2 are 
discussed, respectively, in appendices IX] and IB] 

A. Vector Ward identities 

We use the standard method of enforcing the correct normalization of the vector charge, 
{H\Q^{r)\H) = Q^^{H\H), Q^(r) = ^ t/^^)(f, r) , (27) 

X 

where Q-'^j is the j'th quark number of hadron H, for a convenient set of hadrons. In fact, 
one does not have to project onto a single hadron — any linear combination created by an 
operator with a given j'th quark number will work. As we now show, this method can 
determine all improvement and renormalization coefficients for the vector current, with the 
exception of cy and cy- These two are excluded because the operators they multiply vanish 
at zero spatial momentum. 

To see that all other coefficients can be determined we use eqs. p8|l and ()23|1 with O = V4 
at zero spatial momentum. Consider first degenerate quarks, so that 

-Qk = Zy[l + a{3bv + bv)m] {Q^ - Q!') , (28) 
= Zvrv[l + a{?,dv + dv)m]tiQ, (29) 

where = Xls^'^"'"'^ is the bare charge operator. We have dropped the superscript / 
since the cy and cy terms do not contribute. Enforcing the normalization of these two 
charges for two or more values of the common quark mass determines Zy, 36y + &y, ry and 
3dy + dy. Note that for the singlet charge operator one must use baryonic states, so that the 
total charge is non-zero. To obtain the remaining mass- dependent improvement coefficients 
non-degenerate quarks are required. It is sufficient, however, to work with a "2+1" flavor 
theory, i.e. one in which the up and down quarks are degenerate (with mass mi) but have 
a different mass than the strange quark (ms). In this case, the improvement term for the 
flavor non-singlet charge — is proportional to 

a { [{2by + by)mi + by ms] - [2bymi + {by + by)m^] + fy (mi - m3)trQ} , (30) 
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as can be seen from p8|) . Thus by varying mi at fixed ms, and considering different hadrons 
so that the contributions of the independent operators Q^, and trQ vary, one can deter- 
mine by, by and fy separately.^ Similarly, the improvement term for the singlet charge is 
proportional to 

a{{2dy + dy)m^{Q} + Q^) + {dy + dy)m:iQ'] , (31) 

so that dy and dy can be disentangled by varying mi. 

In practice, Zy and 36y + by have been computed for two flavors usin g th is method 
(although recall that the latter combination is referred to as by in Ref. 2yj). Determina- 
tion of by and by separately should be relatively straightforward, since quark-disconnected 
contractions are not required. By contrast, the determination of ry, dy, dy and fy requires 
such contractions, and will thus be more challenging in practice.^ 

Vector Ward identities cannot be used to determine any of the other normalization or 
improvement coefficients, essentially because the vector symmetries are not broken by the 
discretization. We discuss this further in Appendix O because in some cases it is not 
immediately obvious why vector Ward identities cannot be used. Indeed, in Ref. IJ], we 
argued that it was possible to use such identities, and it is instructive to see the flaw in our 
argument. 



B. Relating improvement of mass and scalar bilinear 

As is well known, the anomalous dimensions of the quark mass and scalar bilinear are 
equal in magnitude but opposite in sign, and it is conventional and convenient to choose 
their renormalization constants to be the inverse of one another, ZgZm = 1- While not 
strictly necessary (the right hand side could be a constant other than unity), this choice 
implies that derivatives with respect to quark mass give rise to insertions of the (space-time 
integral) of the scalar bilinear both for bare and renormalized quantities. In particular, it 

^ Determination of hy and hy alone can be done using — Q^, for which the fy term vanishes in the 2 + 1 
theory. 

^ One can show that the quark-disconnected diagrams give vanishing contributions if one uses the charge 
built out of the conserved vector current on the lattice. We can see no argument to extend this result to 
choices of the bare current which are not exactly conserved. 
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implies that the useful result 

dniH 
drrij 



(where \H) is the state containing an arbitrary hadron of mass itih at rest, V is the spatial 
volume, and k runs over the flavors different from j) holds for both bare and renormalized 
quantities. 

In this section we apply the condition (jH^ as a constraint not only in the continuum limit 
but also for non-vanishing lattice spacing. The argument for doing so is straightforward: the 
quantities appearing in the relation — {H\S'^^^^\H) ^ rriH and rrij — are physical quantities that 
should be improved with appropriate choices of the improvement coefficients introduced 
above. Thus if the relation (jH^ holds in the continuum limit it should receive no 0(a) 
corrections once improvement has been implemented. Perhaps surprisingly, this simple 
relation leads to a number of non-trivial constraints. 

The precise relation we enforce is^" 

dniH 



dfhj 



vms'-'m^ (33) 



Here we have been specific about the meaning of the partial derivative in the presence of the 
regulator. It should be taken with the regulator — here the lattice spacing a — fixed, so that 
the relation survives in the limit that the regulator is removed. In order to match the formal 
result ()32|) the derivative should also be taken with the other improved (rather than bare) 
masses held fixed. Finally, note that the matrix element of S is, as usual, the connected 
matrix element, so that the part of S proportional to the identity [the es term in eq. (|13|)] 
does not contribute. 

Despite the fact that it involves the a priori unknown quantities {H\S^^^\H), this relation 
is useful because we do know how hadron masses depend on bare parameters. In particular, 
because the fermion action depends on the bare quark masses only as 

Siat,F = [o^'S^^M) + . . . (34) 



We stress that all the lattice quantities appearing in this relation are defined to have the same dimension 
as their continuum counterparts, e.g. amu is the hadron mass in lattice units. 
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(even after improvement by the addition of the clover term) one can show that 21 1 



d{amH] 
d{amj) 



(35) 



{am%^,,gl {H\H) {H\H) 

In the derivative on the LHS (left-hand side) the bare coupling and the other bare quark 
masses in lattice units are held fixed. As mentioned above, it is the connected matrix element 
of the scalar density which appears on the RHS. Similarly, one finds 



aV{H\TT{F,,F^,)\H) 



(36) 



{H\H) 

where we define the discretized field strength from the specific form of the gluon action being 
used: 

" ' (37) 



Siat,G = X] ^Tr(F^,,F^j,)(x) 



In this way, our expressions hold for any choice of gluon action. 

To relate the desired derivative in eq. to those which we know, and (jHHjl . we 
proceed in two stages. First, we relate derivatives with respect to improved masses to those 
with respect to bare masses, using the properties of Jacobians: 



dm 



H 



dfhj 



^ d{mH,mk,mi) 



d{mH, rhk, rrii)/d{mj,mk, mi) 



(38) 



d{mj,mk,mi)/d{mj,mk,mi) 
where k and / are the two flavor indices not equal to j. Here all derivatives are at fixed 
a, or equivalently at fixed g^. This means that the derivatives such as dmk/dmj can be 
straightforwardly evaluated using eq. ((221) • The renormalization constants and are 
functions of Qq and a, and thus are fixed. The same is true, to the order in a that we are 
working, of the improvement constants bm etc., since they can equally well be considered 
functions of or g'^. 

Derivatives such as dmn/dmj cannot yet be evaluated since they are taken at fixed a. 
Using eq. 0, one can relate them to the derivatives we know from eqs. (jH^j) and (jHBj) : 



dm 



H 



dmj 



d^amu) 
d{amj) 



di^amu) 



glbg djamn) 
Nf dgl 



(39) 



Putting things together we obtain an expression for the right-hand side of eq. ()33|1 in terms 
of matrix elements of the bare scalar density and the gluon field strength, and the im- 
provement and normalization constants for quark masses. The left-hand side of eq. fj33|l 
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can be expanded, using eqs. (fTH|) and ((221), in terms of the same matrix elements, but with 
the coefficients being the improvement and normahzation constants for the scalar bilinear. 
Matching the coefficients on the two sides gives, after tedious algebra, the relations quoted 
in Table m 

We do not go through the details of this calculation in the general case, but do display 
the subset of the argument that uses only degenerate quarks. In this case, eqs. (jHHj) and 
(jH^ simplify to 

d{amH) I d{am)\a 



drriH 
dm 



dm/dm\a ^ ^ 



d{amH)/d{am)\g2 - glhgd{amH) / d{gl)\am 

dm/dm\a 
V{H\tTS + a{bj2gl)Ti{F^,F,,)\H) 



(41) 
(42) 



Zmr.m[^ + 2a{3d.^ + dm)m\ 
where in the last step we have assumed {H\H) = 1. This should be equated with the 
improved matrix element 

V {H\tTS\H) = V{H\Zsrs{[l + a{3ds + ds)m]tTS + agsTiiF^uF^,)} \H) , (43) 

where, as noted above, the part of trS* proportional to the identity operator does not con- 
tribute. We conclude that ZsTs = l/(Zm^m), + ds = —2{3dm + dm) and bg = ^g^gsi 
which are a subset of the results in Table HI 

We find that the quenched relations ZmZs = 1 and 65 = —2hm (Ref. |l^) continue to 
hold (although the constants themselves will differ from their quenched values), and that 
there are generalizations for some of the new constants that appear (65 = — 6^, etc.). There 
are two particularly interesting, and perhaps unexpected, results. First, there is a constraint 
on the scalar improvement coefficients {ds = bs + 865). This arises because, as noted above, 
there is one less improvement constant needed for quark masses than for the scalar bilinear 
(i.e. there is no independent fm term). Second, the relation bg = 2gQ gs provides another 
way of determining bg, if we determine gs using Ward identities as discussed below. 

Given the relations of Table HJ it is interesting to determine if there are any products of 
masses times scalar densities which maintain their form under improvement. This does not 
hold for the contribution of mass terms to the action itself, i.e. 

J2 rhjS'^"^ ^ m^S^"^ . (44) 
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The corrections to this equation in the general case of non-degenerate quarks are lengthy 
and uninformative, so we quote the result only for the case of degenerate quarks 



m tr S* 



= m tr 5* + a{3drn + dm)m'^ tiS + agsmTT{Ffj_„F^„) . (45) 

mi =mj =mfe 



Here we have dropped the term containing the identity operator since it does not contribute 
to matrix elements. This degenerate quark result illustrates the general point that there is, 
at 0{a), mixing with other terms in the action, and thus no reason to expect that each term 
in the action should be separately form invariant under improvement. The one example of 
form invariance is for the variation of the action under vector transformations: 

{rhj - mk)S^^''^ = (mj - mk)S^^''^ + ©(a') . (46) 

This follows from the definitions (|15p and (j24j) and the relations ZsZ^^ = rsTm = 1, &s = 
—2bm and bs = —bm- It holds for Nf = 2 — 4 (and we suspect for all higher Nf as well). This 
relation plays an important role in the discussion of vector Ward identities in Appendix O 



C. Two-point axial Ward identities 

In this section we investigate which of the improvement and normalization coefficients 
can be determined using two-point Ward identities such as eq. Q. To obtain as much 
information as possible we need to vary the quark masses independently (as done in the 
quenched theory in Ref. joj) and consider the PCAC relation for both charged and neutral 
currents. 

We assume that ca has been determined, so that we know A^^'^^'^ . We can then calculate 
the Ward identity mass, 

^ ^ ^^^.^fl'^f,^^" (j ^ k) . (47) 

Because we have improved the action and the axial current, m^fc should be independent of 
X and of the source J up to corrections of O(a^), and thus we do not give it any arguments, 
nor specify the source. We imagine choosing a source with a good signal, varying the quark 
masses (keeping fixed, as always) and studying the bare quark mass dependence of rhjk- 
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Using the Ward identity Q, we have 

m,, = \fh, + ^^)Ml + aMrM + a6pm,.) ^^^^ 
2 Za{1 + a fo^trM + a bAfrijk) 



Z pZ, 



z 



A 



rrijk + -i^irm - 1) + a {Am% + BmjktiM + CtT{Mf + VtT{M^)) 



(49) 



A = bp-bA-2bm, (50) 
i3 = 6p-6^ + 6„ + 26„ + ^^^^(6p-6A), (51) 

= ^ — (&p-&a) + ^ (52) 

Or d -\- b 

P = Q • (53) 

From the terms hnear in bare quark masses one can extract ZmZp/ZA (as in the quenched 
case) and r.^ (absent in the quenched case). For example, the derivative of rhjk with respect 
to m at M = for degenerate quarks is rmZmZp/ Za-, while the derivative with respect 
to m;, I ^ j, k alone is (r^ — l)ZfnZp/{3ZA) (a quantity which vanishes in the quenched 
theory). Using the relations in Table H] one has thus determined Zp/{ZaZs) and rs = ^jr^. 

From the quadratic terms one can determine the coefficient of each of the four linearly 
independent functions of masses that appear. Thus we obtain four linear combinations of 
the eight constants bA-, bA, bp, bp, bm, bm, dm and dm (given that we know from above). 
By comparison, in the quenched approximation, where there are only three constants, 6^, 
6p and 6^ = d^, one can determine the two combinations 6^ — 6p and b^ Qj. 

To obtain further information, we generalize the method by considering flavor-diagonal 
two-point Ward identities, e.g. 

'd^{A^^^\x) - If ))(x)) _ = (2miP^^^\x) - 2m2P^^^\x)) _ + 0{a^) . (54) 



J \ I J 

This introduces several of the new constants present in the unquenched theory: on the 

left-hand side, and fp, dp and dp on the right-hand side (which contains the flavor-singlet P). 

This corresponds to the fact that, for mi ^ m2, there are contractions in which the source J 

and the axial current (or pseudoscalar density) are not connected by quark propagators. In 

fact, it is only for mi 7^ m2 that this identity gives new information: for degenerate quarks, 

and with appropriate sources, the contractions are exactly the same as those for the flavor 

off-diagonal identity (jSj). 

To enforce the Ward identity, we have to adjust the constants so that the LHS and RHS 

are equal for all choices of x and sources J, up to O(a^). We cannot simply take their ratio, 
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as we did for the off-diagonal Ward identity, since more than one operator contributes on 
both sides of the equation, with relative strengths that we do not know a priori. To proceed, 
it is useful to expand out the operator appearing in the left-hand side of 



LHS 



1 + ^^^±^trM + |tr(A;M) 



{9^tr(A3A^)' + a^tr(A3M)} 



(55) 
(56) 



For convenience we have defined Ag = v^As- We can now divide the Ward identity 

by the overall function of the masses multiplying d^tiiX^A^Y , so that the operator on the 

left-hand side becomes 



LHS' = 



LHS 



1 + 



trM + ^tr(A^M) 



d^ii{\^A^y + a£ii{\^M). (57) 



Since we have previously determined the improvement coefficients in the leading order op- 
erator in this equation, 9^tr(A3A^)^, the coefficients of all other, independent, operators 
appearing on both sides of the rescaled Ward identity can be determined. In particular, we 
immediately see that hA + 3/a and hA can be determined since they multiply independent 
operators in £. 

We now divide the right-hand side by the same factor, and split the operator which results 
into terms linear and quadratic in quark masses: 

RHS 



RHS' 



(feA+3bA) 



trM + ^tr(A^M) 



RHSj -|- RHS^j . 



(58) 



We find 



RHSj 



Z^Zp 



tr(A3P) [2r^trM + tr(A;M)] 



+tr(A3M) 2rptrP + tr(A8P) + 2arpgpTi{F^^F^ 



(59) 



We can determine the coefficients of each independent function of masses (of which there 
are three at this order) multiplying each independent operator. Thus we can determine 
Z^Zp/Za and r^, again, as well as rp and gp for the first time. 

The quadratic terms are more complicated. There are six independent functions of the 
masses, each potentially multiplying three independent operators, although the 1^2 anti- 
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symmetry allows only eight independent products. We find 

RHS^^ = a^^j [j^tr(M)^ + ^tr(A3M)2 + mr(M)tr(A8M) + Jtr(M2)] tr(A3P) 
+ [JtT{\-iM)tr{X'^M) + /Ctr(A3M)trM] trP 

+ [£tr(A3M)tr(A8M) + A^tr(A3M)trM] tr(A8P) | , (60) 
= {2rm - l){bp - 6a) + Qrmibp -bA + dm) + bm = A + QB + 18C , 







(61) 


g = 


3 

-{bA + 2kn + 2rpdp), 


(62) 


n = 


3(6p + km - bA) + {vm + l){bp - bA) + 26^ = 2A + 'iB, 


(63) 


1 = 


Kbp -bA + 2r^dm - b^) = 3A+ 18V , 


(64) 


J = 


2bp + 6/p + 2rpbm - rpbA , 


(65) 


/c = 


Grp(bm + dp- Ia) + 2rra{bp + 3/p) + 2rp{2bm + dp - bA) , 


(66) 


L = 


bm + rpdp - 6a/2 = ^/3 - 6a/2 , 


(67) 


M = 


3(6p + bm- bA) + {bp - 6a + 26m) + rjjp = H + rJ^A 


(68) 



Thus one can determine, in principle, the eight combinations — M.. As indicated, how- 
ever, only three of these are independent of the combinations A — T) that one can obtain 
using the flavor off-diagonal two-point Ward identities. Thus one can only determine seven 
combinations of the ten improvement constants which enter (we exclude 6a and /a since we 
have determined these from the left-hand side of the present identity). A particular choice 
of these seven combinations, is listed in Table IHl To further disentangle the coefficients 
requires three-point Ward identities, which we consider below. 

As noted above, implementation of the flavor- diagonal Ward identity necessarily involves 
quark-disconnected contractions and thus will be numerically challenging. Thus it is inter- 
esting to know how many combinations of quark masses are needed. Are simulations in a 
2-1-1 flavor theory sufficient, or do all three quarks need to be degenerate? It turns out 
that a combination of simulations with degenerate masses (taking three or more values) and 
2-1-1 simulations (with at least two values of the light quark mass differing from each of 
two choices for the strange quark mass) is sufficient. This allows one to disentangle all the 
different linear and quadratic mass dependences that appear. 
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D. Three-point axial Ward identities 



We now turn to the enforcement o: 
tors. The methodology is standard 



' the chiral transformation properties of bihnear opera- 



|22| |: what we add here is the generahzation to 



non-degenerate masses in the unquenched theory, and the use of identities involving flavor- 
singlet components. Contact terms limit what can be extracted with this method, and will 
need to be understood in detail. 

We begin with the simplest example, which is that considered in previous calculations. 
A non-singlet axial transformation with flavor {ij) is performed in a region of space-time V 
which includes a bilinear O^^''^ with k ^ i. If O^^'^^ = ijj''Tijj^, then it is transformed into 
^Qiik) _ '^^^Yijj^. The identity we enforce on the lattice is thus 

{{5^fS)d^^''\y)J^''\z)) = 0''\y)J^'^\z)) + O(a^) , (69) 

where 5aS is the improved and normalized lattice form of the formal variation of the con- 
tinuum action 

[S^i^^S) = a'J2 [(^« + ^j)P^''^ - d.Afj , (70) 

V 

with V a 4-dimensional subset of the lattice containing y but not the source at z. (We 
do not place a "hat" on 6a<S so as to avoid overloading the notation.) Note that only 
quark-connected contractions contribute to this Ward identity. 

Actually, as is well known, the identity (jU^ cannot be satisfied simply by on-shell im- 
provement, since the pseudoscalar density appearing in (^^•'■'iS) comes into contact with 
(^O'fc) Additional off-shell improvement terms are needed, having the same quantum num- 
bers as the product p(^^^0^^''\ and having the appropriate dimension. Since there is an 
explicit factor of fhi + rhj, the only such term with the right dimension and symmetries 
is 5(9^*^^ In previous work, we have used a mnemonic for obtaining this contact term, 
namely that we can off-shell improve the bilinears by introducing an additional "equations- 



of-motion" operator j7[: 

aV^^' ( - f + rrij^ Ttfj'' + a^^T (j) + ijj'' . (71) 

While adequate for discussing the Ward identities (jEHI), this is potentially misleading for 
two reasons. First, the form implies that contact terms between two operators can be 
factorized into the contribution of one operator times that of the other. In fact, there is no 

26 



such factorization of contact terms. Second, for the operators we consider below, there are 
several possible contact terms and the mnemonic cannot be easily generalized. Thus we do 
not use this mnemonic further in this paper. In fact, all we need to know is that there exist 
possible contact terms involving the operator SO^''''^ and multiplied by rhi+rhj oc mij + 0{a). 

A convenient way of using eq. (j69|) is to take the ratio of the two sides having pulled out 
unknown mass factors. We first define the improved (but not normalized) variation of the 
action by 



' ZA{l + abAtTM + abAmij) y ^ m J ^ ' 

This can be determined by calculating rhij using eq. (jl^ . since by assumption we know A^. 
The ratio we consider is then 

_ ((^fa-^'^5)0W-^(y)J(^-)(z)) 

(<50W.^(l/)J('=^)(z)) ' ^ ^ 



[l + a{bso - bo - bA)tTM + aipsorriik - bomjk - bAmj)] + acQrhij + 0{a 



2\ 



ZaZq 

(74) 

Here the contact term is included with an unknown coefficient c'q^^ Requiring Rq to be 
independent of y and J in the chiral limit determines cy and ct, and we assume this has 
been done, so that we know all the O^^^^'^ . One also obtains information on the Zq, as first 



noted in Ref. [2^. Away from the chiral limit Rq should be automatically independent of 
y and J up to O(a^), since there are no additional operators with coefficients to tune. Note 
that the contact term plays no role in this regard. It has the same operator 60 present in 
both numerator and denominator, and so is independent of y and J by itself. 

Thus, for convenient choices of y and J, evaluating Rq away from the chiral limit allows 
one to determine one combination of improvement coefficients for each of the three inde- 
pendent linear functions of the quark masses, except that one of these is not useful as it is 
"contaminated" by the contact term. One complication compared to the quenched case is 
that the mass dependence of the contact term, rhij, is not simply proportional to rriij, but 
has an additional part proportional to (r^ — l)trM, as can be seen from eq. (jlHI)- Because 



Our convention for c'q differs from that defined in Ref. |3- 
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of this, a convenient choice of basis is rriij, rrii — mj and tiM, using which we find 
Ro = ^^{l + « [^so -bo-bA + ipso - ho){2 + r^)/6 + 6a (r„ - l)/3] trM 



+ a{hso + ho){mi - mj)/A \ + a 



c'o - {ho -bo + '^^a) 

LZjqZj p^j^i 



rriij + O(a^) . 

(75) 



The combinations of the improvement coefficients muhiplying trM and — rrij in this 
equation can, in principle, be determined. To do so it is sufficient to use a 2 + 1 flavor 
theory: the coefficient of trM can be determined using = rrij ^ rrik, and that of — rrij 
can then be determined using rrii ^ mj = ruk. To simply the discussion below, we note 
that a quick way of determining the accessible combinations of improvement coefficients is 
to set rhij = 0, so that one can make the substitution rrijk = — trM(rm — l)/3. The resulting 
coefficients of rrii ~ and trM are those that can be determined. 

Applying this method to the bilinears in turn, we obtain the results given in Table ITTl 
These allow the determination of four new quantities: Za, Zs/Zp, bx and Ija (where for the 
latter we have used knowledge of Ba and from the two-point Ward identities). The com- 
binations of S and P improvement coefficients that are obtained, however, are all related to 
those obtained from the two-point Ward identities using the relations listed in Table IH Thus 
we obtain a check of these relations, but no new information on the constants themselves. 

To determine further improvement coefficients we consider axial Ward identities involving 
the transformation of flavor diagonal bilinears. These have not been considered previously, 
and, in particular, they are not needed in the quenched approximation. They involve quark- 
disconnected contractions in an essential way. The analysis is simplifled by the observation 
that the Ward identities considered above allow the complete determination of A)^*''^ and 
thus of the improved variation of the action, (d^^'^^S), including its normalization. Thus the 
only unknown coefficients appearing in the three-point axial Ward identities we consider 
below are those in the operator O and its axial variation SO. We will also make use of the 
previous determination of r^. 



Note that this is a theoretical device and not a practical tool. Setting — and considering mi — m2 ^ 
implies that some quark masses are negative. This is undesirable in practice due to the possible phase 
structure at m ~ 0. In practice one would likely need do a fit using positive quark masses in order to 
separate the different mass dependencies. 
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We first consider the axial rotation properties of tlie flavor-singlet operators. The singlet 
axial current should be invariant under non-singlet axial transformations, so we enforce 

((<5f ^5) \rA,{y)J^''\z)) = 0{a') , [j ^ k] . (76) 

We choose flavor indices 1 and 2 so that we can use the standard Gell-Mann basis of SU (3) 
matrices — permutations of indices are, of course, allowed. Since the right-hand side of 
eq. ()7fij) vanishes (at the order we are working), this relation can only determine relative nor- 
malizations between independent operators appearing on the left-hand- side. Using eqs. f|Tn|l 
and (123), one finds 

tr oc tiA^ + acAdfjiiP + a(i^tr(MA^) 

oc tiA^ + acAd^jiiP + 0(^^/6) [3tr(A3M)tr(A3A^) + tr(A8M)tr(A8A^)] . (77) 

In the second line we have used 

tr(M(9) = ^tr(M)tr(0) + ^tr(A3M)tr(A30) + ^tr(A;M)tr(A;0) , (78) 

valid for diagonal mass matrices, to express the result in terms of independent operators, 
and absorbed the contribution proportional to trMtrA^ into the overall constant. In the 
chiral limit only the first two terms in (f77j) are present and so Ca can be determined. Away 
from the chiral limit, we must avoid contact terms, which are proportional to mi2. Following 
the discussion above, we do so by setting = 0, leading to tr(A8M) = — 2rmtrM, so that 
eq. (fTTjl becomes 



tr A^ 



oc iiA^ + acAd^^tiP + a(rfA/6) [3tr(A3M)tr(A3A^) - 2r^trMtr(A^A^)] . (79) 

mi2=0 



Thus (Ia can be determined by tuning the cofficients of either tr(A3A^) or (assuming r.^ is 
known) tr(Agy4^). Both require non-degenerate quarks (degenerate quarks are not sufficient 
since the constraint = then implies tr(A3M) = trM = 0), but 2 + 1 flavors suffice 
(mi = 1712 7^ to determine the coefficient of ii{\'^Afj), and mi 7^ m2 = m3 for that of 
tr(A3y4^)). The net result is the first determination of both ca and (Ia- 

An almost identical discussion holds for the flavor singlet vector bilinear, the conclusion 
from which is that one can determine cy (for the first time) and dy (which checks the 
determination from the vector charge). 
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For the other three bihnears the Ward identities are different, since the singlet bihnears 
are not invariant: 

{{6f ^S) tfP{y)J^''^\z)) = 2{S'^^''\y)J^''^\z)) + 0{a^) , (80) 

{{sf^s) iPr,MJ^''\^)) = -e,.,ATi.^''\y)J^^'\^)) + 0{a^) , (81) 

{{6f^S)trS{y)J^'''\z)) = 2{P^^^\y)J^'''\z)) +0{a^). (82) 

The general strategy to enforce these relations is to take the ratio of the two sides, and require 
that the result is unity independent of y and J. This should also be true independent of 
quark masses, as long as one avoids contact terms by keeping rhjk = 0. 

Consider first the transformation of the singlet pseudoscalar, (jSUl), and set j = 1, A; = 2 
for convenience. If we work at mi2 = to avoid contact terms, we find 



t^P = Zprp [1 + a{dp + dp/3)tTM] jtrP + a^pTr(F^^F^^) 

+ adp [tr(A3M)tr(A3P)/2 - ar^trMtr(A8P)/3] \ + 0{a'^) . (83) 



This implies that we can determine gp (in the chiral limit) and dp (in two independent ways 
as for d^ above, both requiring only 2 + 1 flavor simulation). Bringing the overall factor to 
the RHS of (jHUj) we obtain (again, with mi2 = 0): 

C(12) 

(84) 

Thus we can determine Zs/{Zprp) (which serves as a check) and the combination multiply- 
ing trM (the latter again requiring only 2 + 1 flavors) . 

The Ward identity just discussed gives the first determination of dp. This then allows 
the linear combinations of constants for the masses and pseudoscalar bihnears determined 
previously (and given in the second section of Table ITT|l to be simplified. In particular, we can 
now separately determine bp, fp, hm and dm as well as dp. This leaves three combinations 
that cannot yet be disentangled: hm + dp, bp — dp, and hm — dm- Note that, at this stage, 
combination multiplying trM in eq. (j84j) does not determine any further coefficients since 
bs-dp = -{bm + dp). 

The same analysis goes through for the tensor bilinear, leading to the first determination 
of rx, ct, dx and bx — dr ~ iTm — 1)^^/3. Since we know bx from previous Ward identities, 
we can extract bx — dx- 
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The analysis for the scalar bilinear is more subtle, due to the presence of the identity 
operator in trS*. The net result, however, is as if this identity component was absent, and we 
find that one can determine the same list as for the pseudoscalar after permuting P S: 
Qs, ds, Zp/{Zsrs) and (using the previously determined 65) bp — ds- Of these, only ds 
is new.^'^ Using the relation ds = bs + 865 from Table HI and the previously determined 
bs = —2bm, we can extract bs = —bm- This allows one to disentangle the remaining linear 
combinations of M and P improvement coefficients, so that we can determine bm, dm, bp 
and dp separately. 

We now return to the identity operator contribution on the LHS of eq. ()82j) . Formally, 
this contribution vanishes since it it is invariant under axial transformations. However, this 
fails on the lattice because the variation of the identity operator does not vanish fast enough 
to overcome the divergence in its coefficient: 



from the LHS of eq. ()82|). This is equivalent to enforcing the difference of two continuum 
Ward identities, with coefficients chosen so as to completely remove the contribution from the 
es term, including its mass dependence. The cancellation between the two terms is between 
contributions proportional to leaving a residue that must be accurate to 0(a^).^^ Thus 
it will require good statistical control. On the other hand, the dominant contribution to 
the two terms will be correlated (since it involves the identity operator), which will help the 
cancellation.^^ 

In addition to allowing the separation of all the coefficients for M and P, this Ward 
identity gives a new method for calculating bg, using the relation bg = ^g^gs and the de- 

^•^ Using the relations in Tabled bp — ds = (bp — dp) + {dp + bm) + 2{dm — bm) + 2dm/3 — 26„i/3, and all 
the coefficients appearing on the RHS are known. 

Note that all three operators in trS", i.e. trS', Tr(F^^Fp^), and tr(AfS'), lead to contributions to (trS'(y)) 
which are separately divergent, but each can be combined with its subtraction term and then multiplied 
by the unknown coefficient to be determined. 

This situation is similar to the subtraction of power divergent mixing in weak matrix element calculations 
in the quenched approximation^ which also benefits from correlations between the two quantities being 
subtracted, and has been successfully carried out in practice 0, . 





(85) 



To overcome this one can explicitly subtract the disconnected contribution 



{trSiy)) X {{6f^S) J^'^\z)) 



(86) 
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termination of gs- Note that this determination can be carried out in the chiral hmit, and 
so one does not need to know bg a priori. This makes the determination of improvement 
coefficients using the Ward identities discussed here self contained. 



The final Ward identity giving new information is 



{(Sf^S) fl^j\y) J{z)) 



Ti^J\y) + T^t'\y)\ J{z)) + 0{a') 



' pa 



(87) 



In particular, the RHS contains the diagonal non-singlet, and thus provides access to fx for 
the first time. To see what we learn from enforcing this identity, we divide through by 



Zt Tl + a tiMbT + amjkbT] — ^ Zt 

mjk=0 



1 + atrM ( br - — — -br 



(88) 



Then we know the quantities on the LHS of the Ward identity. The operator on the RHS 
becomes (again setting j = 1 and k = 2, and still working at mi2 = 0) 



f;(22) 
-t per ~r -t per 



Zt [1 + atrM (6t-^&t)] 



2rT 



+ Na trM^ {tiTp^y + Q + ^ « ^^M^ tr(A^Tp,)^ 



+ Qatr(A3M)tr(A3rp,) 



(89) 



where 



V 



Tt I cJt - ot + — + — 

rm{bT - rTdx) , 



br -r„i\ fT + 



Q= -{bT + 2rTdT). 





(90) 

(91) 
(92) 



Thus in the chiral limit we can determine r^, ct and ct, which provide cross-checks, while 
away from the chiral limit we obtain the three combinations A/", V and Q (all obtainable 
with 2 + 1 fiavors). These in turn can be combined to give 6r and (It separately, as well as 
the quantity rT^d-T — ^t) — i^mfr- Given the determination of — from the Ward identity 
(jHTj) above, we can extract /r. We cannot, however, see any way of disentangling (It and 
Ijt- 

Although we have avoided contact terms by working at fhu = 0, we note that these 
terms are more complicated here than in the previous Ward identity. The contact terms 
arise from the fact that, even if one has on-shell improved the operator P^-J'^) appearing in 
(5^*^^5) and the tensor bilinear Tlit''\ the product of these operators at the same position 
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will not be improved. To improve this product one needs to add all operators with the same 
transformation properties as the product, and having appropriate dimension (here dimension 
4 because of the overall factor of rhj + mfc). In the Ward identity considered above, in which 
the bilinear on the RHS was flavor off-diagonal, only a single operator could appear, namely 
60 with appropriate flavor indices. By contrast, in the present identity, two operators are 
allowed, namely those appearing on the RHS, 



am,k (5Tto)(^) + 5T^f )(y)) and am.M^T.^M ■ 



(93) 



The latter arises from Wick contractions in which the quark and antiquark in P^^'^^ are 
both contracted with the corresponding antiquark and quark in T^''^\ To understand (j^JH)) 
in terms of operators vanishing by the equations of motion requires a generalization of the 
prescription given in Ref. The appearance of a second operator plays an important role 
in the discussion of vector Ward identities in appendix O 



The remaining Ward identities do not provide any new information on the improvement 
coefficients, but do provide several important cross-checks. Consider first 



{6f^S) 5(^^)(y) J{z) 



P^"\y) + P^^''\y) J{z)) + 0{a^] 



Dividing both sides by 



1 + atrM 6, 



(94) 



(95) 



the LHS is then a known quantity, while the RHS becomes (picking j = 1, k = 2 and setting 

mi2 = 0) 

p(ll) _^ p(22) 

Zs [l+atiM (bs-'-^bs)] 



1^ I + AT'a trM^ (trP)^ + + ^'"^ *'^) *^(^8^)' 



+ Q'atr(A3M)tr(A3P) 



(96) 



where 



rp\dp-bs + — + — - 



bs] -rm \ fp + 



V = - [3(6p - hs) + bp-bs + rm{bp + bs ~ 2rpdp 
y 

Q' = hbp + 2rpdp). 
o 



(97) 

(98) 
(99) 
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A similar analysis holds for the Ward identity with 5* and P interchanged, except that one 
must subtract the disconnected component from both sides of the equation. Combining 
these two Ward identities allows the determination of the coefficients listed in Table IHI 

In fact, the disconnected component on the LHS of the Ward identity (j94|) with S and P 
interchanged gives access to the quark condensate: 

^ ((5f ^5) P^'^\y)) ^ ^ {S^"\y) + S^''\y)) + 0{a') m,, = . (100) 

Here the idea is to calculate the LHS, and use it as the definition of the RHS. We know all 
the improvement and renormalization constants appearing on the LHS, and, by working at 
rhjk = we avoid contact terms. In this way we can calculate the improved condensate, 
including its dependence on rhi. This is the 0{a) improved version of the method first 
suggested in Ref. 2^ . 

The final two Ward identities of this type involve vector and axial currents, e.g. 

{{6f^S) Al'^\y) J{z)) = ( [vi^'Xy) - Vl^"\y)] J{z)) + 0{a') . (101) 

Now only fiavor non-singlet operators appear, so we cannot obtain information about singlet 
improvement coefficients. In fact, this identity and that with V ^ A only differ from the 
previous non-singlet axial Ward identities, eq. ()69|) . when nij ^ irtk. In this case there 
are additional quark disconnected contractions absent in (jB^ . Since we are assuming that 
we know the improved off-diagonal axial current, the LHS of this relation is completely 
known, allowing a complete determination of the constants appearing on the RHS. It is 
a straightforward exercise using the form of the improved diagonal non-singlet bilinears, 
eq. (|18p. to show that the new information that one obtains here over that obtained using 
eq. (inni), is a direct determination of fv and by- In particular, this Ward identity provides 
the only cross check of the calculation of fv- The identity with V ^ A similarly provides 
an alternative determination of 



V. SUMMARY AND CONCLUSIONS 



Improvement in the presence of non-degenerate dynamical quarks is far more complicated 
than that with degenerate quarks or that in the quenched approximation. Nevertheless, the 
considerable number of extra improvement coefficients that arise for quark bilinears and 
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quark masses can almost all be determined by enforcing the vector and axial transforma- 
tion properties of these operators. The results from the Ward identities considered above 
are collected in Tables E] and IHl The only scale independent quantities which are left un- 
determined are and hx + dx-^^ To determine these one must use other methods. The 
only two that we are aware of are improved non-perturbative renormalization 10 




matching to perturbative forms for short distance correlation functions of bilinears 
scale-dependent quantitites Zt, ZsZp, and ta are also undetermined, but this had to be 
the case as Ward identities do not involve a renormalization scale. For these one must use 
a method like non-perturbative renormalization. 

It is interesting to understand why the two scale independent quantities cannot be de- 
termined using Ward identities. In the case of dA-, the reason is the lack of an identity 
involving trA^ that has a non-vanishing variation. Thus the overall normalization factor, 
which includes c?^, cannot be determined. This is clearly related to the fact that ta cannot 
be determined, because it is scale-dependent. 

The reason is similar for hx + dx- Ward identities relate the flavor singlet tensor to the 
flavor non-singlet tensor. Overall factors thus cancel, and the mass dependent part of the 
overall factor is proportional to hx + dx- Thus, in essence, this combination cannot be 
determined because Zx cannot. 

To test these arguments, and for completeness, we have extended the analysis to two and 
four non-degenerate flavors. These cases are also of phenomenological interest. A summary 
of the results is given in appendices \M and El With four flavors, one might naively have 
hoped to determine more coefficients, since the three flavor theory is included as a subset. We 
find, however, that although most Ward identities by themselves allow more combinations 
of coefficients to be determined, so that the analysis is cleaner, the final result is the same 
as with three fiavors. In particular, dA and hx + dx cannot be determined, for exactly the 
same reasons as for three fiavors. 

With two fiavors, there are, on the one hand, fewer coefficients to determine, but, on the 
other hand, fewer Ward identities. Furthermore, each identity determines fewer coefficients 
because there are less masses to vary independently. The net result is that there are more 
undetermined combinations of scale independent quantities than for three or four fiavors 
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This corrects the conclusion of Ref. |l2l | that there were three undetermined scale independent quantities. 
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(eight in all). Particularly striking is the result that one cannot determine the improved 
flavor non-singlet axial current, pseudoscalar density or tensor bilinear away from the chiral 
limit. 

Given the complexity of the calculations we outline, one might wonder about possible 
simplifications. Since adding flavors extends {Nf = 2 — > 3) or simplifies {Nf = 3^4) the 
analysis, one might consider using a partially quenched simulation with, say 2 + 1 flavors 
of sea quarks, and four or more flavors of valence quarks. To add additional information 
over the unquenched analysis, one must necessarily consider theories with differing sea and 
valence content, which are therefore not unitary. This makes the basis of the Symanzik 
improvement program less secure. Nevertheless, it is certainly possible, assuming the im- 
provement program remains valid, to use the enlarged graded symmetry groups of partially 
quenched theories to constrain the allowed improvement coefficients, and to generalize the 
analysis presented here. 

Staying within the unquenched three flavor theory, one can ask whether it is sufficient to 
use 2 + 1 flavor theories in which at least two quarks are degenerate. This point has been 
discussed for each identity considered in the text, and we find that 2 + 1 flavors is sufficient 
to determine all the allowed coefficients. 

Another practical question is whether one needs to determine all the improvement co- 
efficients for phenomenologically interesting applications. Particularly interesting are (1) 
matrix elements of the electromagnetic current, (2) matrix elements of flavor off-diagonal 
vector and axial currents (for weak transitions), (3) quark masses and (4) matrix element of 
the mass term in the action, mjS^^^\ We consider these cases in turn. (1) The improvement 
and renormalization coefficients for the electromagnetic current, which is a flavor non-singlet 
for Nf = 3, are Zy, by, by and fv, and these can all be determined by normalizing the ma- 
trix elements of the charge. The determination does, however, require quark-disconnected 
matrix elements. This also determines the improved flavor off-diagonal vector current. (2) 
The off-diaogonal axial current requires Za, bA and Jja, which can all be determined from 
flavor off-diagonal three point Ward identities, as long as we know by- Thus only quark- 
connected contractions are needed. (3) Improvement of individual quark masses requires 
Zm, fmi bm, bjn, dm and dm, i.e. both flavor singlet and non-singlet coefficients [see eq. (pUj) ]. 
To determine these requires all the types of Ward identity we consider, i.e. two and three- 
point Ward identities involving both flavor singlet and non-singlet operators. The same is 
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true for (4), the combination mjS^^^\ Thus we see that for some apphcations there are 
considerable simphfications, but for others there are not. We stress however, that for all of 
these quantities one must first determine bg, although, as discussed in the introduction, a 
perturbative determination of this numerically small coefficient may be sufficient. 

Finally, we note that one might consider our work as an advertisement for other ap- 
proaches to improvement with Wilson-like fermions, namely "Wilson averaging" j3| and 
twisted mass QCDj27| at maximal twist 2^. In both approaches the 0{a) terms are auto- 
matically absent in the physical matrix elements of the operators we consider here, so that 
no improvement of the operators themselves is necessary. For this to hold, however, one 
needs an even number of fermions, and so for t he p hysical case one must simulate with four 
flavors. First work in this direction has begun j28|. 
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APPENDIX A: FOUR NON-DEGENERATE FLAVORS 

In this section we briefly describe the generalization of our calculations to Nj = 4. Having 
one additional flavor does not change the number of improvement coefficients, but does 
increase the number of independent masses that one can vary. This allows the extraction 
of additional improvement coefficients in many of the Ward identities. The final result, 
however, is the same as for Nf = 3: all scale independent coefficients can be determined 
except for and 6t + dr- 

We summarize the results in Tables HI and IIIH and in the following comment on the ways 
in which the calculation differs from that with Nf = 3. 

1. The general forms for the improvement of bilinears and masses, eqs. 1)1512312412 Sp . 
are unchanged, except that A are now 5^7(4) generators. This is because no special 
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TABLE III: Normalization and improvement coefficients determined using various Ward identities 
(which are denoted schematicahy) for Nf = 4. Notation as in Table ^ 



Ward identity 


LO 


NLO 


{H\E^y4'^\H) = Q'H 


Zv, ry 


W, by, fv, dv, dv 




p / Z A, r^n 






[csw, ca] 


hp - bA, hp-bA + dm 




ZraZp 1 Z A, 


^A, Ia, bp, fp, dp, hm, dm, 




rp, gp, [csw, ca] 


hp — dp, hA — dp — dm, hm — dm 




Zv 


hv, by 


and V <^ A 


Zl [cv] 


hA, hA 


^'jj)'j'{jk) ^ j,{ik) 


Za, [ct] 


hr, 46a - hA{rm - 1) 


^fe) p(jk) ^ g(ik) 


Zs/Zp 


bs, bp, bp - bs 


and P <-> 5 




4bA - bA{rm - 1) 


S^i^hiA = dfhvV = 

A. A 


CA, Cv 


dA, dv 


A 


rpZp/Zs, gp 


dp, 4(6s — dp) — hsirm — 1) 










rsZs/Zp, gs 


ds, A(bp - ds) - hp{rm - 1) 




rT,CT, [ct] 


hr, dr, rxidr - br) - rmfr 


^fe')^(jr-j) ^ p{ii) _|_ 


Zp/Zs, rp, gp 


hp, bs, dp, ds, bp - bs 


and S ^ P 


rs, gs 


rp{dp - hp) - rmfp, rs{ds - bs) - Vmfs 




Not new 


by, by, fy 




Not new 


bA, hA, fA 



properties of SU{3) have been used in writing these equations. Thus the number of 
improvement coefficients is unchanged from Nf = 3. 

2. The use of the vector Ward identity, eq. (fTfjl . is also unchanged. 

3. The enforcement of "Z^Z^ = 1", i.e. eq. (|33p . is similar to the three flavor case, 
although more complicated algebraically as it involves determinants of 4 x 4 matrices. 
The results are given in Table HJ some are identical to those with Nf = 3, while others 
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have Nf dependence. There remains a single constraint, due to the absence of an 
"/-term" in the improvement of the quark masses. 

4. The two-point axial Ward identities are somewhat more powerful than for Nf = 3, due 
to the greater number of independent combinations of masses that can be constructed, 
particularly at quadratic order. As can be seen from Table IIIH one can determine ten 
combinations of improvement coefficients, all of which are fairly simple, compared to 
nine when Nf = 3, some of which are complicated (see Table HH) . For four flavors the 
flavor off-diagonal identity, eq. (j47|) . becomes redundant, giving no information not 
contained in the flavor diagonal identity ()54|) . 

5. Three-point Ward identities involving only off-diagonal bilinears, eq. ()fj9j) . are also 
more powerful, because there is one additional combination of masses that is indepen- 
dent from that multiplying the contact term. This allows the separate determination 
of the bo from these identities alone, and of 6a, by, and bs — bp. On the other hand, the 
net result, including information from previous identities, is the same as for Nf = 3: 
the newly determined constants remain Za, Zs/Zp, Ija and bp- 

6. The Ward identities involving axial transformation of flavor singlets, e.g. eq. ()76p. 
give essentially the same information as for Nf = 3. The extra combination of masses 
that is available does not multiply new combinations of coefficients. These identities 
continue to provide the only determination of (Ia, ca and cy, as well as the first 
determination of ds- The latter, together with the constraint ds = bs + Nfbs, allows 
the separate determination of all the mass and pseudoscalar improvement coefficients, 
as for Nf = 3. 

7. The final type of identity, exemplified by eq. (jH7|) . is more powerful with Nf = 4, as 
can be seen from the tables. Nevertheless, the only new information obtained is fp. 

In summary, with four fiavors one has considerably more cross checks, and the extraction 
of individual improvement coefficients is more straightforward, but in the end one obtains 
the same set of coefficients as for three flavors. 
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APPENDIX B: TWO NON-DEGENERATE FLAVORS 

In this appendix we describe how our considerations change when Nf = 2. The first new 
feature is that there is one less independent improvement coefficient for each bilinear and 
for the quark masses. This follows from the fact that dabc oc TT{aa{<Jb,o'c}) = in SU{2) 
(with (Ta, a = 1, 2, 3 the Pauli matrices). Thus one has 

M} = aatiM + tr(aaM)l =^ (1) tr ({a^, M}0) = trMtr(aaC») + tr(o-aM)trC 

(2) tr((TaM^) = tr(aaM)trM . (Bl) 

This means that the bo term in eq. and the bm term in eq. ()24|1 . are not independent 

and can be absorbed into the other terms by changing their coefficients as follows: 

bo-^b'a = bo + bo/2, = + bo/2 , -^b'^ = h^ + bm. (B2) 

In the following, we assume that this has been done, and that the primes are then dropped. 

Although there are thus six less coefficients to determine, it turns out that there are fewer 
Ward identities available, and that each is less powerful than for Nf = 3. The results are 
collected in Tables HI and IIVI and we discuss the salient features in the following. 

1. The use of the vector Ward identity, eq. ^I7\i . is unchanged, aside from the fact that 
there is one less coefficient to determine. 

2. The enforcement of ^^ZsZ^ = 1", i.e. eq. ()33|) . follows the same steps as above, but 
leads to simpler relations because of the absence of and 65. The results in Table H] 
remain valid for Nf = 2 as long as one sets bm = bs = 0.^"^ 

3. The two-point axial Ward identities are less powerful than for Nf = 3, due to the 
smaller number of independent combinations of masses. Combining the flavor diagonal 
and off-diagonal identities, one only determines rmZmZp/ZA, rp/vm and gp at LO, 
and /a, dm, dp, and the combinations dm + bp — Ija and rp(bm — + dp) + fmfp at 
NLO. Note that one cannot determine and rp separately, nor obtain 6^, unlike for 
Nf = ?>. 

Alternatively, one can keep the relations as they stand in the Table and then absorb hm and bs into the 
other coefficients as in eq. (IB2|I . 
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TABLE IV: Normalization and improvement coefficients determined using various Ward identities 
(whicli are denoted scliematically) for Nf = 2. Notation as in Table Ull 



Ward identity 


LO 


NLO 


{H\E,yi"^\H) = Q'H 




fv, dy, dy 


d^A^}^^ = (mi + m2)P(i2) 


rm^m-^p/^A, [CSW, Ca] 


dm, bA — bp — dm 




T^ZmZp 1 Za 


/a, rpdp + rmdm 


= 2m,P(ii)-2mfcP(22) 


rm/fp, gp 


rm{dm + 2dm + 2bp — 2b a) 




[csw, ca] 


rp{2bm + 2dp — 26a + dp) + 2rmfp 


6^r^tiA = 5^r\rV = 


Ca, Cy 


dA, dy 


5f HrP = 25(12) 


TpZpZa/Zs, Qp 


dp 




VtZa, Ct, Ct 


dT 


5f Hr5 = 2P(i2) 


tsZsZa/Zp, gs 


ds 


^(12)j.(21) ^ J^(ll) _^ J-(22) 


Za/tt, Ct, Ct 


dT 


^(^12)^(21) ^ p(ll) ^ p(22) 


rpZp/{ZAZs), gp 


dp 


5f)p(21) =5(11) +5(22) 


vsZsHZaZp), gs 


ds 


^(12)^(21) ^ y(22) _ y(ll) 


Zy/Z\, cy, [ca] 


fv 


5f)y(21)=^(22)_^(ll) 


Zy.Cy, [ca] 


fA 



4. For two flavors there are no three point Ward identities involving only off-diagonal 
bilinears. Thus one loses what has been one of the central tools in quenched studies. 
In particular, for Nf = 3 and 4 these are the identities that are used to determine cy 
and Ct- Here we need to use other identities for this purpose, as discussed below and 
indicated in the Table. 

5. The Ward identities involving axial transformations of the singlet axial and vector 
currents, e.g. eq. ()76p. give the same information as for Nf = 3. However, those 
involving the tensor, scalar and pseudoscalar bilinears give less information. This 
is because there is one less combination of quark masses that is independent of the 
contact term. In fact the analysis is more straightforward, because the contact term 
is proportional to mi2 oc tr(M) + 0(a), and so one does not need to know in order 
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to work at mu = 0. 

At this stage we still have not determined Za or Ba, so we do not know the normal- 
ization of the axial variation of the action. Note also that the tensor Ward identity 
allows the first determination of ct- 

6. The final type of identity is that exemplified by eq. (jS7j) . This again is simpler to 
analyze for Nf = 2, since the RHS is a pure flavor singlet (for T, S and P), rather 
than a mix of singlet and non-singlet. In particular, new information is obtained in 
the chiral limit, and allows one, combined with previous results, to disentangle Za and 

The identities involving the vector bilinears are the first to allow a determination of cy- 
In fact, in the chiral limit, where one works to determine cy, these identities involve 
the same quark contractions as the three-point Ward identities with only off-diagonal 
bilinears that are present for Nf > 3. Thus, from a computational point of view, there 
is no difference in the method to be used to determine cy for Nf = 2. This is not, 
however, the case for ct- 

The following scale independent constants remain undetermined by the Ward identities: 
Ija and cIa; &t, dx and fr] Zs/Zp, rp, bp, dp and fp; and rs and ds or equivalently Vm and 
dm, although the following combinations of these coefficients are known: 

bA-bp-dm, rp(bA-bm-dp) -Tmfp, rpZp/Zs, rprs- (B3) 

Thus in total there are eight undetermined combinations of scale independent coefficients. 
This is six more than for Nf = 3, despite the need to determine six fewer coefficients. 

What is perhaps most striking about this list is that, even if one uses non-degenerate 
quarks when implementing Ward identities, one cannot determine all the coefficients needed 
for flavor non-singlet bilinears composed of degenerate quarks. In particular, since one 
cannot separately determine Ija, bp or 6^, one does not know the overall normalization of 
the corresponding bilinears away from the chiral limit. To determine this one must use a 



method such as those proposed in Ref. 
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APPENDIX C: VECTOR WARD IDENTITIES 



In this appendix we explain the claim made in the text that, other than the normaliza- 
tion of charges, enforcing vector Ward identities leads to no information on improvement 
coefficients. It is interesting to see how this works in detail, and how we concluded otherwise 
in Ref. 12 1. It also gives a good example of how the mnemonic introduced in Ref. 0, 
of using equations-of-motion operators to determine the form of contact terms needs to be 
modified. 

We begin by recalling some results from the main text. Enforcing the charge of hadrons, 
eq. (j27|) . determines the improved, renormalized diagonal vector currents, Vlf^\ aside from 
the cy and cy terms. The latter terms do not contribute to the diverg ence, so O^Vj^'^'^^ is 
fully improved. This implies that the divergence of the off-diagonal currents, S^V^^"'^'', j 7^ k, 
are also improved, because these involve the same improvement coefficients as the diagonal 
currents (in particular, Zy, ry, by and by] the coefficient fy is known but not required). 
These results hold whatever the precise form of the underlying lattice current. 

Next we recall the form of the exact lattice vector Ward identities. Making the change 
of variables Syipj = ipk and Syipk = —i'j over a region of the lattice V, one finds 

J] [(m, -m,)5(^-^) 0{y)Jiz)\ = ([5(?*0](2/) J(z)> , (CI) 

V / 

Here y is in the region V, while z is not, and 6y^O is the variation of the operator O under the 
vector transformation. Note that the bare lattice quark masses appear in ()C1|) irrespective 
of the presence of the csw term, and that S^^''^ is the local scalar bilinear. The current 
Yiat,{jk) ^j^^ usual lattice vector current, also unaffected by the csw term. It is associated 
with a link, and not a site, but its divergence is associated with a site: 

= 1 [^.(x)7^f/^,^V^,(a; + /2)+V^,(x + /2)7^f/J,^7/;,,(x)] , 



2' 2 

^ Vj-'(^-'=)(x + |)-yj-'(^-'^)(x-|) 



a 



(C2) 



The results flCl|) and ()C2|) hold both for j ^ k and j = k. In particular, the diagonal 
lattice current is conserved, and it generates canonically normalized vector transformations. 
This implies that the charge constructed from the lattice current is correctly normalized 
[as can be derived from eq. In other words, for this current, Zy^ = 1 = r^*, and 
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the mass dependent improvement coefficients all vanish. As is well known, however, the 
lattice current is not improved. For this one must add cy- and cy-hke terms, as in eq. ©, 
except that they must be associated with a link, rather than a site. It is possible to do so 
in such a way that taking the divergence of the current [in the form that appears in the 
Ward identity ()C1|) ] exactly cancels the cy- and c^-like terms. Because of this the cy- and 
cy-like terms neither contribute to the vector Ward identities nor affect the values of the 
other improvement and normalization constants. 

In summary, the divergences of the flavor-diagonal conserved lattice vector currents are 
automatically improved. As noted above, this also implies that the divergences of the flavor 
off-diagonal lattice vector currents are improved. Thus the S^V^"*'*'"'^'' term in the vector 
variation of the action in the lattice Ward identity ()C1|1 is improved. This means that 
it can be replaced, up to errors of 0{a?), with d^vll^^\ the divergence of the improved 
local vector current. In this replacement we do not have to worry about contact terms as 
Xlv vanishes except at the surface of the region V, and by assumption there are no 
other operators there. 

The other term in the vector variation of the lattice action in (jCyljl involves bare masses 
and scalar densities. As shown by eq. pUj) . this maintains the same form when written in 
terms of improved masses and scalar densities. 

Combining these observations we can rewrite the lattice vector Ward identity in terms of 
the improved bilinears and masses considered in the main text: 

- _ d^vf)] 0{y)J{z)) = {[S'^'0]{y)J{z)) + 0{a') , (C3) 

V 

This relation leads, amongst other things, to the normalization condition for the vector 
charge, eq. (j2Z|)- Note that this relation is "off-shell" improved, since there are no contact 
terms of 0(a). These could only enter with the scalar density term, but the result eq. ()4fj|l is 
an algebraic identity, so using it does not introduce additional terms. The improved bilinear 
gijk) jg^ however, only on-shell improved, and this will be crucial in the following. 

The result eq. ()C3|) shows that, if one uses on-shell improved local bilinears and masses 
in the discretized form of the vector variation of action, then the operators in the associated 




The form of the cy and cv terms for + /i/2) is d^[Tfj_u{x + /i) + T^i,(a;)]/2, where the derivative is 

the symmetric difference. 
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Ward identities automatically transform with the correct normalization. This is the concrete 
form of the statement made in the text that the vector Ward identities are automatically 
satisfied. There are some subtleties, however, in the application of this result, and we spend 
the remainder of this appendix describing some examples. 



1. Two-point vector Ward identities 

In quenched studies of improvement, the two-point vector Ward identity 

( [(% - m,)S^^''\x) - d,Vi^''\x)] J('=^)(0)) = 0(a2) , j =^ k, x^O, (C4) 



has been used as part of the method employed to determine improvement coefficients 
The result eq. ()C3|) shows, however, that this identity is automatically satisfied, as long as one 
uses the correctly normalized vector current. Thus it serves only to check the normalization 
of the vector current, and provides no information on the improvement coefficients of quark 
masses and scalar bilinears. This point was not appreciated in Ref. 0. 



2. A misleading three-point vector Ward identity 



In Ref. 12], we argued that we could determine gs and gp by enforcing the vector 
transformation properties of the flavor-singlet scalar and pseudoscalar bilinears, respectively. 
This is incorrect, as we now show. It is simplest to do this with the pseudoscalar density as 
it has no mixing with the identity operator. 

The identity in question is 



J2 [d.V^''^] tTPiy)j('^\, 



(C5) 



where we work in the chiral limit so that there are no contact terms. We recall that, in the 
chiral limit, the improved, normalized pseudoscalar density is 



trP = Zprp 



trP + agpTriF^^F^ 



liv ) 



(C6) 



Our previous argument was that, in order for the LHS of ()C5|) to be of O(a^), 0(a) contri- 
butions from the two terms in trP must cancel against each other, thus determining gp. In 
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fact, it follows from eq. ()C3|) that both terms separately are automatically of O(a^), e.g. 



(C7) 



so that the identity ()C5|) is satisfied for any value of gp. In other words, the two terms in 
trP are separately invariant under vector transformations up to 0{a?). 



3. A paradox and its resolution 



Another vector Ward identity used in Ref. |12l] was 



OC^'^y) - 0^"\y)\ J{z) 
+ contact terms + O(a^) . (C8) 



Here we work at non-zero quark mass, so there are contact terms of 0(a) because the scalar 
density is not off-shell improved. These are proportional to (m^ — nik) (without any factors 
involving as in the axial case), because fhj — oc nij — m^. + 0{a). If we use the 
expressions (fTBj) and (fTHj) for the improved bilinears, and divide through by 



Zo [1 + abotiM + aborrikj] , 



(C9) 



the Ward identity we are enforcing becomes 



(m, - mk)S^^'\x) - d^V;^^''\x)\ 0^''^^'\y)J{z) 

[O('^'^)J^y) _ 0("^^\y)] + abo ^"^^ : ^^'^ [0^''\y) + 0^^^\y)] 



+ afo{mk — mj)tTO{y) >J{z) ) + contact terms + 0{a ) 



(CIO) 



Previously, we argued that the form of the contact term could be determined by off-shell 
improving S^^''^ with the addition of a term proportional to the equations-of-motion operator 
H]. This leads to a contact term proportional to 



(fTTjl . as in Refs. 



contact term oc {a{mk - mj) [0^''''\y) + C^^Xy)] J{z)) 



(CU) 



i.e. of the same form as the bo term. Thus we concluded that bo could not be determined 
from this Ward identity, but that fo could, since it multiplies an independent operator. 
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To see that this is incorrect, we use the general result ()C3|1 . from which it follows that 

"(m, - mk)S^^'\x) - d,V^^'\x)\ 0^^^^^\y)J{z)) = { [0^''^'\y) - O'^^^'^y)] J{z))+Oia') . 

(C12) 

Note that the improvement terms (those proportional to cq) rotate just as the bare operators, 
so rotates as a whole. Comparing to eq. ()C10|) we see that the first term on the RHS is 
obtained automatically, while the bo and fo terms must either cancel against contact terms 
or vanish. The discussion above implies that the bo term is canceled, but we must have 
/o = 0. This would be a paradoxical conclusion because we used the vector symmetry in 
the first place to conclude that the fo terms are needed, and yet we have now used the same 
symmetry to conclude that they vanish. 

In fact, this result is wrong. The flaw in the argument is that there is an additional 
contact term in eq. (jC^lOjl proportional to 

(a(mfe-m,)trO(2/)J(z)) , (C13) 

and this is of the right form to cancel with the fo term. Thus fo does not need to vanish, and 
indeed cannot be determined from the vector Ward identity, just like bo- The operator in 
the new contact term, trO, is allowed because it appears in the operator product of S^^''^ and 
in addition to the other contact term operator O^'^''^ + 0^^^\ The new operator arises 
from diagrams in which both the quark and antiquark in the two operators in the product 
are contracted together — the closed quark loop then coupling to tiO through intermediate 
gluons. The presence of this operator shows that the mnemonic of off-shell improvement 
through the addition of equations-of-motion operators needs to be generalized beyond the 
considerations of Refs. 0, • A more straightforward approach is simply to enumerate the 
allowed operators using symmetries. Indeed, one can turn the previous argument around, 
and use the fact that symmetry implies the presence of the fo terms to imply the existence 
of the new contact terms. 

Finally, we note that these considerations resolve the puzzle concerning the counting of 
improvement coefficients mentioned in the text. The off-diagonal bilinears require one less 
improvement coefficient than the diagonal bilinears. How can this be consistent with the 
fact that a vector transformation rotates the former into the latter (as in the Ward identities 
just discussed)? The answer is provided by the presence of the new contact term, which 
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allows there to be an fo term in the diagonal bilinears but not in the off-diagonal ones. 
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